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Abstract: Since the birth of molecular evolutionary analysis, primates have been a central focus of study and mitochondrial DNA is 
well suited to these endeavors because of its unique features. Surprisingly, to date no comprehensive evaluation of the nucleotide 
substitution patterns has been conducted on the mitochondrial genome of primates. Here, we analyzed the evolutionary patterns and 
evaluated selection and recombination in the mitochondrial genomes of 44 Primates species downloaded from GenBank. The results 
revealed that a strong rate heterogeneity occurred among sites and genes in all comparisons. Likewise, an obvious decline in primate 
nucleotide diversity was noted in the subunit rRNAs and tRNAs as compared to the protein-coding genes. Within 13 protein-coding 
genes, the pattern of nonsynonymous divergence was similar to that of overall nucleotide divergence, while synonymous changes 
differed only for individual genes, indicating that the rate heterogeneity may result from the rate of change at nonsynonymous sites. 
Codon usage analysis revealed that there was intermediate codon usage bias in primate protein-coding genes, and supported the idea 
that GC mutation pressure might determine codon usage and that positive selection is not the driving force for the codon usage bias. 
Neutrality tests using site-specific positive selection from a Bayesian framework indicated no sites were under positive selection for 
any gene, consistent with near neutrality. Recombination tests based on the pairwise homoplasy test statistic supported complete 
linkage even for much older divergent primate species. Thus, with the exception of rate heterogeneity among mitochondrial genes, 
evaluating the validity assumed complete linkage and selective neutrality in primates prior to phylogenetic or phylogeographic 
analysis seems unnecessary. 
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The mitochondrial genome of vertebrates exhibits 
several peculiar features including maternal inheritance, 
the presence of single-copy orthologous genes, a lack of 
recombination, evolutionary neutrality, and a high 
mutation rate—characteristics that make it well suited for 
evolutionary studies (Pesole et al, 1999; Saccone et al, 
2000). Among all the characteristics and assumptions of 
the mitochondrial genome, the two most important 
aspects for explaining the evolutionary process are the 
presumed freedom from positive Darwinian (adaptive) 
natural selection and lack of recombination. 
Mitochondrial proteins are central to the cellular 
oxidative phosphorylation pathway and are functionally 
conserved across metazoan phyla (Gray et al, 1999). The 
importance of these mtDNA protein products supports 
the hypothesis that some variation in the mtDNA 
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molecule is adaptive. Nonetheless, of the numerous 
single mtDNA gene studies and several complete 
mitochondrial genome comparisons published over the 
past two decades, very few have been able to reject 
neutrality in favor of positive Darwinian selection 
(Garcia-Martinez et al, 1998; Rand et al, 1994). Indeed, 
previous studies on vertebrates indicated that different 
levels of variability are attributable to functional 
constraints and/or slightly deleterious polymorphisms, a 
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pattern consistent with near neutrality (Nachman et al, 
1996; Templeton, 1996). 

Recombination breaks down the correlation in 
genealogical history between different regions of a 
genome, which may be the incorrect inference of 
evolutionary history (Schierup & Hein, 2000). The issue 
of whether recombination occurs in the human 
mitochondrial genome remains controversial as the 
necessary enzymes for recombination are in fact present 
in the mitochondria, and a few paternal mitochondria do 
penetrate the egg during fertilization (Thyagarajan et al, 
1996), providing evidence that suggests recombination is 
possible, at least in humans. Recent broad surveys of 
animal mitochondrial genomes have also concluded that 
recombination is widespread (Piganeau et al, 2004; 
Tsaousis et al, 2005). Another interesting issue about 
evolutionary patterns of the mitochondrial genome is the 
codon usage bias in its protein-coding genes. Many 
studies have analyzed the differences in codon usage 
between different organisms and between proteins within 
the same organism (Bulmer, 1991; Kanaya et al, 2001; 
Sharp et al, 1995) and one of their main conclusions is 
that there are strong correlations between codon usage 
and genomic GC content. Moreover, human codon usage 
may be determined solely by GC content and its 
isochores composition (Kanaya et al, 2001). Despite 
suggestive evidence and hypothesis, the relationships 
between codon usage and the underlying evolutionary 
constraints are still not fully understood (Prat et al, 2009). 

Since the birth of molecular evolutionary analysis, 
the evolutionary relationships of our own order, primates, 
have been of central interest. Strangely, no comprehensive 
and accurate evaluation for the nucleotide substitution 
pattern(s) of the various mtDNA genes (i.e., tRNA and 
rRNA genes, synonymous and nonsynonymous positions 
of protein-coding genes) have been conducted to date. 
Most primate studies demonstrated mtDNA evolutionary 
neutrality and no recombination, and generally reported 
mtDNA still evolves about 5- to 10-fold more rapidly 
than single-copy nuclear DNA (nDNA) (Brown et al, 
1982) at an overall nucleotide substitution rate of 1% per 
million years (Brown et al, 1979; Wilson et al, 1985). 

As large libraries of whole-mtDNA genome 
sequences from primates accumulate, we are able to 
conduct more sophisticated genome-wide analyses to 
obtain solid information on the pattern of whole-mtDNA 
genome evolution in primates while also determining the 
relative evolutionary rate of each mitochondrial 
component. Here, we report on the rates and patterns of 
mtDNA sequence variation and the codon usage in 
primates and describe the results of tests conducted to 
detect recombination and selection. These results will not 
only be of interest in their own right but will also have 
implications for the use of mtDNA in evolutionary 
studies of primates. 
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1 Materials and Methods 


1.1 Sequences 

54 Mitogenomic sequences from 44 primate species 
were downloaded from GenBank (Table 1). All 54 
sequences were aligned, edited, and compared using the 
Sequencher 4.5 (Gene Codes Corporation, Ann Arbor, 
MI). We excluded all gaps and ambiguous alignment 
sites, resulting in 14,764 bp of sequence from the 13 
protein-coding, 2 rRNA, and 21 of the 22 tRNA genes. 
Protein-coding sequences from the L-strand-encoded 
genes (ND6 and 8 tRNA genes) were converted into 
complementary strand sequences. The tRNA-Glu gene 
and the Control Region were not included because the 
alignments of those sequences are difficult due to a large 
number of highly variable sites, insertions, and deletions. 
1.2 Phylogenetic analyses and sequence diversity 

The genealogical relationships between the 
mitochondrial genomes were analyzed by parsimony 
using PAUP* (Swofford, 2002). Bootstrapping (Felsenstein, 
1985) was used to test monophyly. For this study, 1,000 
pseudosamples were generated to estimate the bootstrap 
proportions. 

Nucleotide diversity (mz) was calculated for all 13 
protein-coding, 2 rRNA, and 21 concatenated tRNA 
genes. Measures of sequence diversity were also 
calculated by the sliding window method using the 
DnaSP 4.10 (Rozas et al, 2003). A window (500 bp in 
length) was moved along the sequences in steps of 50 bp. 
Patterns of nucleotide diversity for all positions as well 
as patterns of synonymous and nonsynonymous 
nucleotide variation were calculated in each window, and 
the value was assigned to the nucleotide at the midpoint 
of the window. The gamma parameter alpha (a), which 
represents the extent of rate heterogeneity among sites, 
was estimated from the individual data sets with TREE- 
PUZZLE 5.2 (Schmidt et al, 2002) using the 8-categories 
option. 

1.3 Codon usage bias 

Several parameters related to codon usage bias, 
such as the codon bias index (Morton, 1993), the 
effective number of codons (Wright, 1990), and G+ C 
content at second and third positions as well as overall 
were estimated for each primate mitochondrial protein- 
coding genes using DnaSP 4.10 (Rozas et al, 2003). 
Furthermore, to determine whether the compositional 
changes of the nucleotide content in the primate 
mitochondrial protein-coding genes are caused by 
directional mutation pressure or a result of positive 
selection, we performed a correlation analysis. If the 
nucleotide bias affects both the synonymous and 
nonsynonymous sites in protein-coding genes, positive 
selection could not solely explain this phenomenon, 
because positive selection should not affect silent 
nucleotide positions. The correlation analysis of GC 
content at the second codon position (nonsynonymous 
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Table 1 Primate taxa and GenBank accession numbers for mitochondria genome genes. 
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Family Taxon Genbank Family Taxon GonDank 
Accession No. Accession No. 
Galagidae Galago senegalensis AB371092 Presbytis melalophos DQ355299 
Otolemur crassicaudatus AB371093 Pygathrix nemaeus DQ355302 
Lorisidae Nycticebus coucang AJ309867 Pygathrix roxellana DQ355300 
Loris tardigradus AB371094 Nasalis larvatus DQ355298 
Perodicticus potto AB371095 Chlorocebus pygerythrus 1 EF597501 
Daubentoniidae Daubentonia madagascariensis 1 AM905039 Chlorocebus pygerythrus 2 EF597500 
Daubentonia madagascariensis 2 AB371085 Cercopithecus aethiops sabaeus DQ069713 
Indriidae Propithecus verreauxi AB286049 Papio hamadryas Y18001 
Lemuridae Varecia variegata AB371089 Theropithecus gelada FJ785426 
Lemur catta AJ421451 Macaca sylvanus AJ309865 
Eulemur macaco AB371088 Macaca thibetana EU294187 
Eulemur mongoz AM905040 Macaca mulatta AY612638 
Eulemur fulvus mayottensis AB371087 Macaca fascicularis FJ906803 
Eulemur fulvus fulvus AB371086 Hylobatidae Hylobates lar X99256 
Tarsiidae Tarsius syrichta AB371090 Pongo pygmaeus D38115 
Tarsius bancanus AF348159 Pongo abelii X97707 
Cebidae Saguinus oedipus FJ785424 Gorilla gorilla 1 D38114 
Cebus albifrons AJ309866 Gorilla gorilla 2 X93347 
Aotidae Aotus lemurinus FJ785421 Pan troglodytes 1 EU095335 
Atelidae Ateles belzebuth FJ785422 Pan troglodytes 2 D38113 
Pitheciidae Callicebus donacophilus FJ785423 Pan troglodytes 3 X93335 
Cebidae Saimiri sciureus 1 AB371091 Pan paniscus D38116 
Saimiri sciureus 2 FJ785425 Homo sapiens 1 AM948965 
Cercopithecidae Procolobus badius DQ355301 Homo sapiens 2 X93334 
Colobus guereza AY863427 Homo sapiens 3 D38112 
Semnopithecus entellus DQ355297 Homo sapiens 4 AM711903 
Trachypithecus obscurus AY863425 Homo sapiens 5 AF346992 


mutations related, GC2) and GC content at third position 
(synonymous mutations related, GCs3) with CG content 
of all protein codon genes (GCc) were implemented 
using R 2.6.2 (Ihaka & Gentleman, 1996). 
1.4 Tests of neutrality 

To investigate protein evolution, we first calculated 
dn/ds ratios among the set of 54 primate data and tested 
their significance using Z-tests (Nei & Kumar, 2000) for 
each of the 13 protein-coding genes as implemented by 
MEGA 4.0 (Tamura et al, 2007). We also tested for site- 
specific positive selection with Bayesian posterior 
probabilities under the Ny98 fitness regime (0<@, <1, 
@2=1, and œ; >1) (Nielsen & Yang, 1998) as described 
and implemented by MrBayes 3.1.2 (Huelsenbeck & 
Ronquist, 2001; Ronquist & Huelsenbeck, 2003). 
1.5 Tests of recombination 

We calculated the Kelly (1997) measure of linkage 
disequilibrium (ZnS); ZnS being the average of R? (Hill 
& Robertson, 1968) over all pairwise comparisons. We 
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also employed the pairwise homoplasy test ( Øw ), a 
relatively new and robust statistical estimator of 
recombination (Bruen et al, 2006) that is a coalescent- 
based estimator of the genealogical correlation or 
compatibility among sites, which is negatively correlated 
to recombination. The estimator was calculated, and its 
significance was estimated via a permutation test using 
Splitstree 4.10 (Huson & Bryant, 2006). 


2 Results 


2.1 Phylogenetic analyses and sequence diversity 

The parsimony tree for primates, based on the 
mitochondrial genome, is shown in Figure 1. Each 
internal node was supported by more than 80% bootstrap 
(data not shown). The mtDNA polymorphism data and 
estimates of nucleotide divergence are summarized in 
Table 2, and the distributions of sequence diversity 
across the mitochondrial genome are presented in Figure 
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Figure 1 Parsimony tree of 54 primates based on mitochondrial genome (Hi~ He were selected nodes for recombination tests) 


2. An obvious decline in primate nucleotide diversity was 
noted in the subunit rRNAs and tRNAs as compared to 
the protein-coding genes. Among the 13 protein-coding 
genes, ATP8-ATP6 and the NADH dehydrogenase (ND) 
complex (with the exception of ND1) showed higher 
rates of sequence diversity, while the cytochrome 
oxidase (COX) complex exhibited lower diversity within 
Primates. Notably, the nonsynonymous divergence plots 
were similar to those obtained from the total nucleotide 
divergence, but the trend was not observed in the sliding- 
window plot of synonymous divergence. Although the 
COX complex exhibited an obvious decline in total 
nucleotide divergence and nonsynonymous divergence, 
these genes had higher rates of synonymous diversity. 
Based on the estimated gamma parameter alpha (a), the 
rate heterogeneity among sites was not even, but the 
pattern of heterogeneity among sites was distributed 
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across the primate mitochondrial genome. 
2.2 Codon usage 

Several parameters related to codon usage bias were 
estimated to check whether synonymous mutations are 
selectively neutral. The codon bias index (CBI) is a 
measure of the deviation from the equal use of 
synonymous codons with values ranging from 0 (uniform 
use of synonymous codons) to 1 (maximum codon bias). 
CBI values were intermediate in our study, ranging from 
0.354 to 0.504 (Table 3). The effective number of codons 
(ENC), which may range from 20 (only one codon is 
used for each amino acid; i.e., the codon bias is 
maximum) to 61 (all synonymous codons for each amino 
acid are equally used; i.e., no codon bias), were likewise 
intermediate in the primates mitochondrial protein 
condoning genes (Table 3). GC content at the second 
position (GC2) ranged from 38.0% to 40.4%, GC content 
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Table 2 Characteristics for each gene data set in the 54 
Mitogenomic sequences from 44 primate species 





Gene data set N Tiotal Tsyn. Tnonsyn. a 
Protein genes 
NDI 954 0.22594 0.56062 0.10797 0.6400 
ND2 1032 0.28488 0.55369 0.19488 0.6829 
COX1 1533 0.19501 0.65372 0.04446 0.6678 
COX2 678 0.24309 0.62540 0.11977 0.6908 
ATP8- ATP6 834 0.28099 0.59298 0.17451 0.8017 
COX3 783 0.21378 0.60615 0.08494 0.7019 
ND3 342 0.26373 0.58340 0.15246 0.9531 
ND4L 294 0.25770 0.61033 0.13563 0.6112 
ND4 1359 0.25191 0.56225 0.14085 0.6478 
NDS 1788 0.25525 0.56755 0.14766 0.7309 
ND6 486 0.25715 0.50608 0.17487 0.6829 
Cyt b 1134 0.22837 0.57817 0.11319 0.7729 
pacer 11217 0.24287 0.58072 0.13166 0.7410 
RNA genes 
12s rRNA 864 0.15497 0.6910 
16s rRNA 1424 0.16676 0.6700 
21 concatenated tRNA 
genes (excluding 1259 0.13533 0.5689 
lysine tRNA) 
All genes 14764 0.22121 0.7229 


N: Numberof sites; Mora: nucleotide diversity for all sites; syn: nucleotide 
diversity for synonymous sites; Znonsyn.: nucleotide diversity for nonsynonymous 
sites; a: rate heterogeneity index / Gamma parameter. 


at third codon position (GCs) ranged from 30.7% to 
50.4%, and overall CG content of the protein codon 
genes (GCc) ranged from 37.4% to 46.5%. Correlation 
analysis indicates that the nucleotide bias affects both the 
synonymous and nonsynonymous sites in protein-coding 
genes. Likewise, there is a highly significant correlation 
between the GC3s and GCc (R? =0.9932, P < 0.001; 
Figure3 A). The correlation between GC2 and GCc is 
also highly significant (R?= 0.7772, P < 0.001; Figure 3B). 
2.3 Neutrality Tests 

As a conservative measure of selection, Z-tests were 
conducted for each protein-coding gene in all species. In 
all cases, dn<ds was highly significant (P = 0), consistent 
with strong purifying selection of mitochondrial genes in 
these comparisons. The dy/ds ratios differed among 
genes (Figure 4) and these differences may potentially be 
attributable to positive selection or variation in levels of 
functional constraint among genes. The ATP8, ND2, and 
ND6 genes were characterized by a dn/ds ratio well 
above the average, while the COX1 gene ratios were the 
lowest among the species. Site-specific positive selection 
inferred from Bayesian posterior probabilities indicated 
that no sites were under positive selection for any gene 
of any of the species. 
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Table 3 Summary of codon usage index of 13 concatenated 


protein genes from 44 Primates species 

Mitogenomic sequences 

Organism ENC CBI GCz GCs GCc 
Galago senegalensis 44.147 0.406 0.393 0.390 0.412 
Otolemur crassicaudatus 43.363 0.428 0.399 0.449 0.441 
Nycticebus coucang 41.565 0.470 0.397 0.400 0.423 
Loris tardigradus 46.337 0.363 0.394 0.386 0.413 
Perodicticus potto 42.444 0.461 0.390 0.348 0.394 
Daubentonia madagascariensis 1 44.075 0.399 0.383 0.363 0.402 
Propithecus verreauxi 43.910 0.406 0.399 0.417 0.420 
Varecia variegata 44.723 0.389 0.389 0.327 0.388 
Lemur catta 45.347 0.383 0.394 0.373 0.407 
Eulemur macaco 44.925 0.383 0.393 0.381 0.410 
Eulemur mongoz 45.215 0.380 0.394 0.382 0.410 
Eulemur fulvus mayottensis 45.045 0.396 0.391 0.331 0.388 
Eulemur fulvus fulvus 43.987 0.406 0.390 0.346 0.392 
Tarsius syrichta 45.321 0.395 0.392 0.348 0.399 
Tarsius bancanus 45.541 0.364 0.388 0.338 0.393 
Saguinus oedipus 42.664 0.409 0.383 0.339 0.389 
Cebus albifrons 42.905 0.398 0.385 0.307 0.379 
Aotus lemurinus 44.815 0.367 0.385 0.334 0.390 
Ateles belzebuth 44.227 0.385 0.385 0.325 0.388 
Callicebus donacophilus 44.608 0.367 0.387 0.331 0.391 
Saimiri sciureus 1 44.552 0.368 0.387 0.331 0.390 
Procolobus badius 45.041 0.420 0.401 0.458 0.441 
Colobus guereza 44.081 0.427 0.397 0.447 0.434 
Semnopithecus entellus 44.262 0.415 0.403 0.428 0.429 
Trachypithecus obscurus 46.006 0.394 0.382 0.328 0.388 
Presbytis melalophos 46.065 0.357 0.389 0.379 0.407 
Pygathrix nemaeus 46.553 0.373 0.387 0.321 0.384 
Pygathrix roxellana 46.949 0.354 0.387 0.333 0.387 
Nasalis larvatus 45.514 0.394 0.384 0.311 0.380 
Chlorocebus pygerythrus 1 44.783 0.415 0.400 0.434 0.431 
Cercopithecus aethiops sabaeus 44.474 0.426 0.399 0.439 0.436 
Papio hamadryas 44.504 0.425 0.396 0.440 0.434 
Theropithecus gelada 44.556 0.426 0.400 0.442 0.436 
Macaca sylvanus 44.741 0.434 0.400 0.438 0.434 
Macaca thibetana 45.325 0.375 0.387 0.338 0.389 
Macaca mulatta 45.586 0.377 0.380 0.307 0.374 
Macaca fascicularis 46.576 0.374 0.387 0.324 0.385 
Hylobates lar 43.281 0.457 0.397 0.498 0.461 
Pongo pygmaeus 43.111 0.436 0.397 0.455 0.442 
Pongo abelii 43.123 0.435 0.398 0.454 0.443 
Gorilla gorilla 1 43.177 0.435 0.398 0.454 0.442 
Pan troglodytes 1 44.396 0.427 0.394 0.456 0.443 
Pan paniscus 41.489 0.504 0.404 0.504 0.465 
Homo sapiens 1 42.622 0.457 0.397 0.474 0.451 


ENC: Effective number of codons (Wright, 1990); CBI: Codon bias index 
(Morton, 1993); GC2: GC content at the second position; GCs3: GC content 


at third codon position; GCc: overall CG content. 
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Figure 2 Sequence diversity of the mitochondrial genome by 
nucleotide position 
Diversity measure was calculated for a sliding window of 500 
bp with a step size of 50 bp. (A) Patterns of total nucleotide 
diversity ( 7 wal) for all positions; (B) patterns of 
nonsynonymous nucleotide diversity (Znonsyn.) for concatenated 
protein genes; (C) patterns of synonymous nucleotide diversity 


(ayn.) for concatenated protein genes. 


2.4 Tests of Recombination 

The measures of linkage disequilibrium are 
summarized in Table 3. The average R? (Hill & 
Robertson 1968) across all pairwise comparisons (ZnS) 
for each individual data set and the pooled data provided 
evidence of recombination between relatively distant 
divergent species but not for more recent divergent 
species. The pairwise homoplasy test statistic (Mw) for 
the pooled data also provided similar results to the ZnS- 
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Figure 3 Correlation of GC content at the third codon position 
(GC33), and GC content at the second position (GC2) 
with overall CG content (GCc) 

(A) Correlation of GCs and GCc of coding sequences; (B) 


Correlation of GC2 and GCc of coding sequences. 


test, but the ® w test statistic provided no evidence of 
recombination for each individual data set. Based on our 
data sets, the assumption of complete linkage of the 
mtDNA genome appears to be justified only between 
more recently divergent species. 


3 Discussion 


Primate mitochondrial genomes exhibit striking 
patterns of both polymorphism and substitution rate 
heterogeneity among sites and genes, as evidenced by 
sliding-window estimates of sequence diversity, 
nonsynonymous and synonymous divergence, and the 
estimated gamma parameter alpha (a) (Table 2, Figure 2). 
Less primate nucleotide diversity was observed in the 
subunit rRNAs and tRNAs than that in the protein- 
coding genes. Among the 13 coding genes, rates were 
higher in the ND genes and the ATP8- ATP6 gene and 
lower in the COX complex genes, consistent with the 


Www.zoores.ac.cn 


Preliminary analysis of the mitochondrial genome evolutionary pattern in primates E53 





NDI ND2 COXI COX2 ATP8 ATP6 COX3 ND3 ND4L ND4 NDS ND6 CYTB 


Figure 4 Ratios of nonsynonymous/synonymous rate per gene in the mitochondrial genomes of 54 primates. 


Dotted line indicates the average ratio. 


Table 4 Recombination test for primate mitochondrial genomes. 


Internal subclades (node) 











Recombin- 
Gene x 
ation test Hı Hə H; H4 Hs He 
NDI ZnS 0.0405** 0.0458** 0.0588** 0.0770** 0.2884 0.7266 
Dy 1.2488 1.0927 0.8219 0.6476 0.1496 0.0659 
ND2 ZnS 0.0489** 0.0566** 0.0699** 0.0810* 0.2874 0.7251 
Ow 1.4217 1.2213 0.9230 0.7143 0.1779 0.0579 
COX1 ZaS 0.0424** 0.0487** 0.0611** 0.0784** 0.2823 0.7544 
Oy 1.6117 1.4236 1.1128 0.8265 0.1662 0.0345 
COx2 ZnS 0.0795** 0.0874** 0.1147 0.0947* 0.2966 0.7880 
Ow 3825 1.1986 0.9053 0.8271 0.1300 0.0000 
ZnS 0.0479** 0.0558** 0.0696** 0.0845** 0.3012 0.6755 
ATP8-ATP6 

Dw 4795 1.2841 0.9917 0.8155 0.1424 0.0772 
COX3 ZnS 0.0407** 0.0553** 0.0575** 0.0730** 0.2273 0.7828 
Dy 4295 1.2397 0.9946 0.7820 0.2722 0.0029 
ND3 ZnS 0.0375** 0.0493 ** 0.0610** 0.0790* 0.2893 0.7436 
Ow 5716 1.3769 0.9752 0.7419 0.2004 0.0751 
NDAL ZnS 0.0406** 0.0489** 0.0628** 0.0676** 0.2877 0.7621 
Dy 3633 1.2171 0.9726 0.7554 0.1937 0.0811 
ND4 ZnS 0.0445** 0.0513** 0.0579** 0.0787* 0.2811 0.7561 
Ow 4206 1.2099 0.9199 0.7240 0.1476 0.0101 
NDS ZnS 0.0401** 0.0458** 0.0608** 0.0770** 0.2812 0.7088 
Dy 3903 1.2037 0.9311 0.7230* 0.1596 0.0472 
ND6 ZnS 0.0644* 0.0726* 0.0815* 0.0750** 0.2754 0.7926 
Dy 1.264 1.1078 0.9061 0.7237 0.1910 0.0163 
CYTB ZnS 0.0381** 0.0430** 0.0545** 0.0683* 0.2680 0.7586 
Ow 1.3633 1.2238 0.9792 0.7067 0.2383 0.0398* 
AILP ZnS 0.0442** 0.0504** 0.0640** 0.0770** 0.2835 0.7394 
Ea Ow 1.2291** 1.0311** 0.9620** 0.7483 0.1691 0.0411 


* indicates p<0.05; ** indicates p<0.01 


results from various mammals (Kumazawa et al, 2004). 
Most notably, the nonsynonymous divergence plots were 
similar to those obtained from the pooled data nucleotide 
divergence, while the synonymous changes were only 
slightly different between individual genes of mtDNA 
(Figure 2). Accordingly, rate heterogeneity may result 
from the rate of change at nonsynonymous sites. 

The observed pattern of heterogeneity across the 
mitochondrial genome was similar to the pattern of 
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mitochondrial genome evolution at the intraspecific level 
in humans (Elson et al, 2004) and both the inter- and 
intraspecific levels in Drosophila lard (Ballard, 2000), 
where the synonymous distribution was more 
homogeneous and the nonsynonymous distribution was 
more heterogeneous. Additionally, the results obtained 
here differed from the inter- and intraspecific studies 
based on whole genome in Gadus morhua and its relative 
species (Marshall et al, 2009). Interestingly, all 
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mitochondrial genome comparisons mentioned above 
showed significant rate heterogeneity and site specific 
hyper-mutability. Possible explanations for rate 
heterogeneity in the mitochondrial genome include 
selection pressure, variation in the mutation rate of each 
gene (involving base composition bias and properties of 
the L-strand during mtDNA replication) (Marshall et al, 
2009), and the consequence of neutral mutational effects 
(Galtier et al, 2006). The detected evolutionary patterns 
of the mitochondrial genome will influence the choice of 
mtDNA regions used for phylogeographic studies, since 
the more variable regions of the genome harbor excess 
homoplasy relative to the more slowly evolving regions 
(Galtier et al, 2006). Marshall et al (2009) proposed that 
the more variable regions of the mitochondrial genome 
harbor excess homoplasy relative to the more slowly 
evolving regions, so choosing to examine more slowly 
evolving regions for evolutionary analysis among species, 
and even within species, with an increase in the sequence 
length may be wiser. Based on the primate mitochondrial 
genome comparison in this study, we suggest that the 
subunit rRNAs genes, tRNAs genes, and the COX 
complex genes may be promising sites for future 
investigations of primate phylogeography. 

CBIs estimated in this study revealed that there was 
intermediate codon usage bias in primate protein-coding 
genes (Table.3). Several evolutionary processes have 
been postulated as the major factors that determine 
codon usage. For a number of different organisms, it was 
suggested that codon usage was best explained by 
selection for tRNA abundance, gene expression levels, 
and translational optimization (Duret, 2000). In other 
cases, the codon usage was explained by mutation rate, 
mutation preference (Powell & Moriyama, 1997), 
environmental conditions (Goodarzi et al, 2008), 
generation time (Subramanian, 2008), and recombination 
rates (Meunier & Duret, 2004). Although the 
relationships between codon usage and the underlying 
evolutionary constraints are still not fully understood, 
two major paradigms of codon usage have been found in 
most species (Prat et al, 2009). Natural selection is 
expected to yield a correlation with codon bias in more 
highly expressed genes. Alternatively, codon bias could 
be generated by strictly neutral processes, such as 
mutational biases, where some mutations occur more 
than others across the genome by local variations in the 
base composition (Duret, 2002). Correlation analysis 
indicates that the nucleotide bias affects both the 
synonymous and nonsynonymous sites in primate 
protein-coding genes. There is a highly significant 
correlation between the GC3s and GCc (R? =0.9932, 
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Figure 3A) and the correlation between GC2 and GCc is 
also highly significant (R? =0.7772, Figure 3B). Because 
positive selection should not affect silent nucleotide 
positions, our results support the idea that adaptive 
evolution is not the driving force for the codon usage 
bias, indicating that GC mutation pressure may 
determine codon usage in mitochondrial genomes of 
different primate species. 

Using a conservative measure of selection, Z-tests 
indicated that dv<ds was highly significant (P = 0) in all 
cases; this result is consistent with purifying selection of 
mitochondrial genes. A similar result was first found 
during analysis of the mitochondrial genomes of humans, 
chimpanzees, and gorillas (Hasegawa et al, 1998) and 
also reported in additional studies on fruit flies, humans 
and Atlantic cod (Ballard, 2000; Elson et al, 2004; 
Marshall et al, 2009). However, site-specific positive 
selection from a Bayesian perspective indicates that no 
sites are under positive selection for any gene of any of 
the species, indicating that most mutations are nearly 
neutral or slightly deleterious, a pattern consistent with 
the nearly neutral theory of molecular evolution 
proposed by Ohta (1973). Our results therefore indicate 
that the evolutionary pattern of primate mitochondrial 
genomes and each single mitochondrial gene did not 
severely deviate from evolutionary neutrality. 

A presumed absence of recombination is another 
key feature of mtDNA that makes it a useful 
phylogeographic molecule (Marshall et al, 2009). The 
uniparental inheritance of mtDNA indicates that 
recombination should be rare or absent, but a broad 
survey of 267 published animal mtDNA data sets by 
Piganeau et al(2004) provided indirect evidence of 
recombination in more than 14% of cases. Based on the 
pairwise comparisons (ZnS) and the pairwise homoplasy 
test statistic (Mw) for the 13 concatenated protein genes, 
the assumption of complete linkage of the mtDNA 
genome appears to be justified only between more 
recently divergent species. However, no evidence of 
recombination among the individual genes was detected 
with the pairwise homoplasy test statistic (®w) even 
among the more divergent species, indicating that 
recombination is perhaps a result of substitution rate 
heterogeneity in the primate mtDNA genomes. 
Additionally, as the lineage divergence increases, the 
divergence of each individual gene may produce 
artificial significant ZnS values based on all pairwise 
comparisons from the pooled data and may lead to false 
detection of recombination. Thus, it appears unnecessary 
to consider the effects of recombination on the evolution 
of mtDNA in primates. 
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